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<151> 1999-12-01 

<160> 17 

<170> Patentin Ver. 2.0 

<21()> 1 
<211> 3632 
<212> DNA 

<2 13> Phaffia rhodozyma 
<:22();- 

<221; - 5'UTR 
<222> (922).. (923) 
<223> EXPERIMENTAL 

<220;- 
<221> exon 
<222> (986).. (1096) 

<22{)> 

<221> intron 
<222>( 1097). .(1306) 

<220> 
<221;' exon 
<:222>( 1307). .(1456) 



<220> 

*: 221> intron 
<222> (1457). .(1555) 



■ :220^ 

• -22 1 exon 

■ 222 - (1556). .(1589) 

• :220 ■• 

• -22 1> intron 
':222> (1590)..(1()94) 

"'22(}> 

■ -22 1 ■ ■ exon 
•:222>( 1695).. (1799) 

<-:22()> 

' 221 -"' intron 
<222> (1800).. (1920) 

<'220> 

' 22 1> exon 

<222>( 1921). .(1982) 

■ 22()> 

■■ 22 1> intron 
<222>(1983)..(2()7()) 

■ 221 )> 

'■ 22r-- exon 

<:222> (2()77)..(214()) 

':22()> 

''221:-- intron 

■ 222:: (2141 ).. (2246) 

<22(.)> 

^221- exon 

--;222>(2247)..(2272) 

<22()> 

' 221 ■ intron 

<222: - (2273)..{2390) 

<22( i:-^ 

<22V-' exon 

<222> (2391).. (2507) 



<220> 



<221> polyA _site 
<222> (2663).. (2664) 
<223> EXPERIMENTAL 

<4()0> 1 

tcctgtlgat aatctttcta acgccttgta ctttgaccaa ggcgtttgtc cgaaattttg 60 
caaacttagt gttggtcgca tggacggtct tcggatccag aactgacggc tcgccaataa 120 
agtatgacga tggtagaggt gaaggaggga accacaggtt gaccagtctc aaagagtgct 180 
gatgtgcgcg aggatttgtc attaaatggt gttgtatatg ctagagccaa gagaagacat 240 
ttggttttgg ttttggtttt gcatttgatg agatgtgtca cgattgaaga cgggaggagg 300 
ctcactaacc caagaagcca ggatcaggag gaatgcctcc cccttttcat caagatcttt 360 
ctcacatcga acatttgaca ttctctttag tatccttcta tccttttctt ccaacttctc 420 
ccattgtatc gactttgctc gacttgctct tcttatctct gagcagagat gggcattcca 480 
atatcgaagg agcgacacaa gaccttggag tttgggtaac agatgaagag gggccgaggt 540 
ggatggggct gtaggaagta gctgatcgat gagttcctgg atgatgatag gcgaaggaac 600 
agacatagga tctctgtctc gtcctggaat tactgagtct tgtatccagc gtgttcttgt 660 
ctcgaagaag ccttcaagat cgatgtaaga taagacaggc aatgaggacg gacgaacgaa 720 
cgaacgaaaa gaacagaaga gctggtaagt cagtcagtca gtcagtcagt caatcaaaca 780 
ctggtgtcta gggttatagc tcgacgcgac gcgacgcgtt tgagacgcga tatgcttacg 840 
taatacctgg cgtcatcccc ccagccgagg caagagccga gccgctcgtg aacgacaaaa 900 
ttcaaaaggc tttctccatc ttaagctcat tctcatctaa ccgactcatc tcgttcccat 960 
cattcccatc attctaccgc catcc atg tct gtt cga gca tec etc tct tec 1012 
gtg tct aga cag act ttc gtc get cct get get ttc cag ate agg gca 1 060 
aag cat ace ctg cct gag ctt cct tac get tac gat gtaagacttt 1 1 Oo 
tecgtgttct cctattegte getttcttgg tttttttcgt cttcgccctc tagctcttct 1 166 



icgtcctttc tgtcclgctc tttgtlgttg atattcagct cgatagacta acccatctca 1226 
tctcctggac attcttttac tggaaacgta tcttgtcctt ggtttllctt ggctttggtt 1286 
gaaaattcct ctccactcag gcc ctg gag ccc tec ate tee aag gag ate atg 1339 
aee ctt cac cae ace aag eac cat eag act tat gtt aac ggc etc aac 1387 
get gcc gag gag age tae teg gee get gtg ggc aag gag gat gtg ctt 1435 
acc cag gtt aag ctt cag tet gtacgtctga ccgttttttt atcgaecgga 1486 
aegcctggtg aggagggaga tgaagtttga tgagegctca tcgtctagca cgttgacccg 1546 
atcatacag get etc aag ttc aac gga gga gga cac ate aat c 1589 
gtcagtgata ttetteaaac tcttgetgag caagteaggt caagctgact gtttegettt 1 649 
gtttctgegg atetatctea tcettgattt ggcatgatga aacag ac tet etg ttc 1 705 
tgg aag aac ttg get ccc tal gga tee gag gag get ace etc tet gaa 1753 
gga cet etc aag aag get ate gag gaa tet ttt ggt tet ttc gag g 1 799 
gtcegtccat etatcttcet atteagttgt gtttggttcc ggtatactea tetgttttgt 1859 
ttccceacaa aataaaaata aaaatcttgt ectctccggg ggttcgactg caegtteata 1919 
g cc ttc aag aag aag ttc aac get gac acc get get gtc caa gga tec 1967 
gga tgg ggc tgg ctt gtatgtatca tatcctttec atctcaaact ettcteagag 2022 
tctttttect tgagacttca aactgactat acatgtttct acaacaaaea acag ggc 2079 
ttg aac ceg ctt act aag aag ctg gaa gtc acc acg aee gcc aac cag 2127 
gac cct ctg ctt a gtaagttgtt tctaeatgat tttetatcte aaegcgatct 21 80 
geatgattcg teactgatte actggattet cttgtltegt ttttetcggg atgattteat 2240 
aaacag ct cac att cet ate ate gga gtt gac gtgegtatet ttettgaata 2292 
gtegtagcgt ctgatctcgt tttattgact gacgtgttgc ttctgtceaa atcattaaaa 2352 
aaaatgaaaa caaataateg attgaccgae gaaaacag ate tgg gag eac get ttc 2408 



tac ctl cag tac aag aac gtc aag cct gac tat etc get get gtt tgg 2456 
tee gtt ate aac tac aag gag gca gag gee cga ttg cag get get etc 2504 
taa gcgggacgaa aagtaaegae atatgaaggg aggatcaaat atcgtttctt 2557 
cataaacaae tttegaggea gatgggagag tacgtacaag agaggtttgt atggagaatt 2617 
gagtttgttg acggttagca ggttatgata tatgtagcta tagtctagtc taaatctgaa 2677 
agaagagaac aagatggttt gtccgaagag attgagagat eaagcecggt eatctgatgt 2737 
egaacaaaea tgeectggtc tgccaaeagt ttctagcaca ttatgaccat gttcatgtgt 2797 
aaattgggaa atgagccaga aaggtttatt atctaattca ttgattcatg cgactatgga 2857 
tacatatggg atttccagaa caaacagatg caacaaagca cggcattttc caaagatcga 2917 
gtcctcccac aagtatgcgg caaggtttgt tgttaagaga tataaaagca gacgacaaaa 2977 
caaatcgttt atcgaccctg tgcaccaaca ccgtgaccgt ttgacgagtt ggtagagttg 3037 
tagttgttgc tgtteaaagg agctccagac tggacgcttc caagcttcaa eaacttctcg 3097 
gcagcgtcgc tgttegggaa aagaaaaggc aaaaaggaac agagcgataa gcatatgtga 3157 
ttctctactt cttataggct cttagctcaa gteaactcac atgtctttgg cggtaccgaa 3217 
gacgttctca agctgctget tggaagcttt tcegagcttg ecagtaggtc cetggttgga 3277 
gaagaagatg tcgaaggcta agggcgatga aaagcatgaa gatattagct atcggcgcga 3337 
taaaagtgtg acgagatgaa aatggagaaa agatgattcg caccategac gacctcgacc 3397 
aaaggaatgg aggtgtcacc ggccttccac ttettgtact cctcaacgtt gacgaagatg 3457 
acgaageagt cggtggcett agcctcgggc tcgtagatgc tgatgaaaca caataggtag 3517 
taggagagga gaaagagaag atgatgagat gtcaggatgc ttgcttcact gtagatggag 3577 
gaagaagata tgcgaagcaa gaeatacact ttggaaagag cttgaaccat tgtag 3632 



:210>2 



<21 1 > 3375 
•:212 • DNA 

<213 • Phaffia rhodozyma 
;22()-' 

;221 • 5'UTR 

'222 m974)..(975) 

:223 ^- EXPERIMENTAL 

<22()> 

<221> cxon 

<222-> (1040).. (1063) 

<220> 

<22 1 > intron 
<222> (1064)..(1237) 

<:220> 

<22 1 > exon 

<222>(1238)..(:i280) 

<22()> 
■221- • intron 
:222>( 128 !)..( 1358) 

<22()> 

<221> exon 

<222>(1359)..(;i402) 

<22():- 

• '22U - intron 
<222-Ml403)..(:i4S3) 

<22():- 
<221-- exon 
<222:-{ 1484).. (1589) 

<22(); • 

<221 • intron 

<222: - (1590).. (1674) 

<220> 

''221> exon 

•'222>(1675)..(1826) 



<22()'- 

' 221 • intron 

■022 •(1827). .(1007) 

• :22() ■ 
■221 • cxon 
<222-(1908)..(n)22) 

<220 • 

^;221 intron 

<222- (1923 )..( 1992) 

<22()::- 

' 22 1 - exon 

-:222> (1993).. (2056) 

<220-> 

221> intron 
<222> (2057)..(2131) 

<22():- 

<22 1 > cxon 

<222:m2132)..(2157) 

<;22() 

■■ 221> intron 

< 222 >( 21 58 ).. (2238) 

- :22()> 
<.221 > exon 

<222 > (2239).. (2293) 

- 22(.) ^ 

•■ 221 - intron 

<222 > (2294).. (2376) 

' 22(l > 

•■'221 cxon 

<222> (2377).. (2426) 

• 220> 

• :221:-' intron 

•■222 ' (2427). .(2524) 

• 220 ■ 



:222> (2525),. (2542) 
220> 

221;> polyA_site 
222> (2667). .(2668) 
223> EXPERIMENTAL 



<4()0> 2 

cttatccltc tgccgctggt ctctgtctgt cgagtgtgtg ggatggtttt ggatatgttc 60 
ctatacgaaa ggtagcgcag agcaaagctg acagtattaa gcaagacaag agcttctttc 120 
tgttgacaga tgaaaggacg aactatgaag ctgtccatgc tccccaaacc gattgacaca 180 
ccgccgtcag gcaacgcaga atttctcact gcUcgacgt cacaccaaca tcgatcctcc 240 
atacctaaaa gcagatcgag acacattgtt ggtcgccatg ttggatggat gtacatcaaa 300 
cccacagcat atatcactca catgtgagaa ctccgtagcc tctaccttct tgtctctcaa 360 
tctgaatgtc tcgttgagag gtggaalgaa tgtttacagt ttgagaagac gaaagaaaga 420 
aagagaagag aagagaggaa tacgtacgac gaagttatca tcgtatggga acttttctaa 480 
aaaactgcct atagtagaga cgatctctgg aggaaagctc tgtagtatga tagtgaagag 540 
cgagcaagtc tgggcaagtg catccttcgt ctacaagaaa gagaccagga aatgaaggag 600 
agacgagtaa gcaggtacct accgatattg gatcgttctc tctacccagc gatgccttca 660 
ccacgcgttc tatctcttct tgggatggca gatacatact taacgagagc aatctgatgt 720 
ataccgaact tcgaacggaa tgatcccaga atcctcttga acccttgaac ccttgaaccc 780 
tggaaccaag taccaaccga gcaacacgcc gatacggtcc acaccacaga accacacgcc 840 
ctcgtcatta aaggtgggac gcgccgatgc Iggttacgtt cggcccaatc cggaagttac 900 
cggcttggac gtgcctgtaa ccatgccctg acggtatttc gccttcagct aactccatct 960 
catctttttc ctttactacc acaacccacc cttgaacctt cttccccggc ttttttacta 1020 
tatccatcta tcaatcatc atg get cct tac act ctt ccc gac gtaagcttaa 1073 



agtttgagct gtgtgtgctt atctcaatct tggagttgaa ctcaccgttt tttgttlttg 1133 
cttcctggtt tttttatcgg catccctcct ttttttcccc tcgtggtcgc atatgalUg 1 193 
ctcatcaatc ggcgtttccc atgcatcttt gtcatccgtt tcag ctt cct tac get 1249 
tac gat gcc ttg gag cct tac ate tct aag g gtgagattct tagtcagact 1300 
gttgttccgg ttcgacacga tagctaatcg tctctcgttc ctcaatatga aeatgcag 1358 
aa ate atg ate ett cac cac tec aag cac eat cag act tac gtc 1402 
gtaegtaatc taaaggteat etccgtctac atggccggat caacttgcte atagatcttc 1462 
ettctgtteg gcgetaegta g ace aac etc aac gee get ate cag get tte 1513 
tec cag acc aat gae ate aag gee eag ate get ett cag age get etc 1561 
aag tte aac gga gga gga cac ate aac c gtaegatcat tctecetett 1609 
ctggettate atatgtgttg ettgtcacta aeaegcatgc aaceecggga tatetcaecc 1669 
tgtag ac tec etc tte tgg aag aac atg get cct gee gac tct get gat 1718 
gee aag etc acc gag gga teg etc aag act gcc ate gae aag gac ttt 1766 
gga tee tte gag gag tte aag aag aag tte aac act get act etc ggt 1814 
gtc cag gga tct gtcagtatct cgtttgctte gaeatactet cagctttcet 1 866 
tccgtaaaet gacgaatagt tttteggaca tgtaettgta g gga tgg gga tgg etc 1922 
gtgcgtttga cettttteca etttgaacat tagegatagt gataeetaac aactgtgaat 1982 
tggaatatag gga tac aac ace get ace aag cac etc gag ate gcc acc 203 1 
acc gee aac cag gat cce ett ate a gtatgtgact tetctegtgt 2076 
ggtcaecata agecagttgc tgaeacattt cgttcgctgt etctcgactt cgtag et 2133 
ttg act cce ate att ggt ett gac gttagtaatt ctatetagtg attggagtcg 21 87 
agttctgaae ttgeettgat cteaaaegaa tgaatcaatt tcttttggta g ate tgg 2244 
gag cac get tte tac etc cag tac aag aat gtc aag cct gat tac ctt g 2293 



gtacgtaatt ctctaltcgt ttgccccggt ttgatclltg actcactctt caaaalgttt 2353 
tcgtttgta actttgaaaa acag cc get ttc tgg aac glc tgc aac ttt get 2405 
gag get cag cga agg ttt gat gtgagtacag gcgctacccc tacggaggaa 2456 
gcgaaggtga gctgaccact ttttatcttt ctgatttgga atgaacgatc cgatgatcaa 2516 
acaaacag get get gtc aag get taa tggtcccatt tatctctttg attegacggc 2572 
gatgaeggct ttctcgcatc cgaagaaggc aaggctatga ttactgttat tctgecatgt 2632 
tgcttgcttt gctatgctct atgttctttt cttttgcctc tcttcaaagc caaggcgtta 2692 
aggaaggccc ttcagtctgt tttacatatg eacatataca tgagaaeata tcacggactc 2752 
ggcggctggt ggtcctcttg agcgtcggct tcaagattag tgtccacacg tgaagcgttc 28 12 
ggtgceatcc aacctggtag gaatccccat cgggcgggaa tccaattatc aattggcggt 2872 
cggccagatt cgagctcggg tatcteagaa gcgtcaagcg ggcgcatttc caggccttta 2932 
^g^ggagcaa atttaatccg cctgggtgtt cagcgagaca cgaacagttt gaaacagagt 2992 
ctgettgtga gttactcggc gagatcactg aggactaaac tttctcagct cgtggacgaa 3052 
aagaacgaac eaaacggtct tecctgtatc tcgaccatct ccttctccat ctcttaeaac 3112 
acctcggatg aactccaagg cttgctttcc aaagttcaaa caaactccgg gttgccatcc 3172 
acctggtttg tctctaacga gccgagggat atccatcgtt cggaacgttt gaacagactg 3232 
gatggtaggt ggccggtegc tteggaagcc aatcataatg gtgggaatcg agagaaggaa 3292 
tgattgggcc cagtgtttaa gaettgtgtt tgttggcaga gtacggacgg aaagtaggac 3352 
agacttaatc aaggcgagcc aag 3375 

<210>3 
<211>951 
<212> DNA 

<213> Phaffia rhodozyma 



<220-- 
<221: ' exon 
<222>(1)..(34) 

<220 ' 

<221> intron 
<222-{35)..(115) 

<220> 

<221> exon 

<222>(117)..(159) 

<220> 

<221 > intron 
<222.>(160)..(242) 

<220> 

<221> exon 

<222>(243)..(363) 

<220> 

<221> intron 
<222>(364)..(436) 

<22(J> 
<22 1> exon 
<222> (437)..(951) 

<400> 3 

tec gga age tea gat ace get cga gat cet cga g gtttgtgtgc 44 
tttci>ctttg ttcgcatgga tgaagctgtt aacttaaaaa aatcctcgtg tttctctttg 104 
tttcaacata g gt ttc tct ctt aag gtc aag acc tct gag gga aac tgg 1 53 
gac ttt gtacgtattc ttatcgactg agtcateaag ctcgttatcg ctctettace 209 
ctcatccttt tgtgtctctg tctacacctc tag gtc gga aac aac act ccc ate 263 
ttt ttc ttg aga gac eca gee aag ttt ecg ate ttc att cac ace cag 3 1 1 
aag agg aac ecg cag aca aac tct aaa gac aag gac get ttc tgg gac 359 
lae c gttcgtata acettgteae tccctgcgtg ccgetctgat tcatgttgac 412 



cttgtctttg alalaatltt atag ta tec caa aac ccc gag tec gtg cat cag 4()5 
gtg ctg cac ctg ttc agl gat cga gga acc cct get tct tac cga cac 5 13 
atg cat ggt tac tct gga cac ace ttc aag atg gtc aac agg aac ggt 561 
gac tgg aat tat gtc cag att cac atg egc acc gat cag ggt gtc aag 609 
act cac acc aat gaa gag get teg aaa etc gac gee tec aat ccc gat 657 
tea aac gga gac gac ttg ttc gac gea ate aag aat gga gac ttc cct 705 
age tgg acg gtt cag gta cag gta atg tct cct gag cag gee cag aag 753 
ttc aga tac aac att ctg gat etc acc aag gtc tgg tec cac aag gag 801 
ttc cca ett agg acg att gga aag ttc act ttg aac cga aac gtg gat 849 
aac tat ttc gea gag gtt gaa cag etc gee ttt get cct tec cat ctg 897 
cct cct gga ate gag ccc teg aac gat ccc gtc ctt cag get cga eta 945 
ttc tec 951 



<210>4 
<211>669 
<212> DNA 

<213> Phaffia rhodozyma 

<220> 

<221>CDS 

<222>(1)..(669) 

<400> 4 

atg tct gtt cga gea tee etc tct tec gtg tct aga cag act ttc gtc 48 
Met Ser Val Arg Ala Ser Leu Ser Ser Val Ser Arg Gin Thr Phc Val 
I 5 10 15 

get cct get get ttc cag ate agg gea aag eat acc ctg cct gag ett 96 
Ala Pro Ala Ala Phe Gin lie Arg Ala Lys His Thr Leu Pro Glu Leu 
20 25 30 

cct tac get tac gat gee ctg gag ccc tec ate tee aag gag ate atg 144 
Pro Tyr Ala Tyr Asp Ala Leu Glu Pro Ser He Ser Lys Glu lie Met 



35 40 45 

acc ctt cac cac acc aag cac cat cag act tat gtt aac ggc etc aac 1 92 
Thr Leu His His Thr Lys His His Gin Thr Tyr Val Asn Gly Leu Asn 
50 55 60 

get gee gag gag age tac teg gee get gtg ggc aag gag gat gtg ctt 240 
Ala Ala Glu Glu Ser Tyr Ser Ala Ala Val Gly Lys Glu Asp Val Leu 
65 70 75 80 

acc cag gtt aag ctt cag tct get etc aag ttc aac gga gga gga cac 288 
Thr Gin Val Lys Leu Gin Ser Ala Leu Lys Phe Asn Gly Gly Gly His 
85 90 95 

ate aat cac tet ctg ttc tgg aag aac ttg get cec tat gga tec gag 336 
He Asn His Ser Leu Phe Trp Lys Asn Leu Ala Pro Tyr Gly Ser Glu 
100 105 110 

gag get acc etc tct gaa gga cct etc aag aag get ate gag gaa tct 384 
Glu Ala Thr Leu Ser Glu Gly Pro Leu Lys Lys Ala lie Glu Glu Ser 
115 120 125 

ttt ggt tct ttc gag gee ttc aag aag aag ttc aac get gae ace get 432 
Phe Gly Ser Phe Glu Ala Phe Lys Lys Lys Phe Asn Ala Asp Thr Ala 
130 135 140 

get gtc caa gga tec gga tgg ggc tgg ctt ggc ttg aac ccg ctt act 480 
Ala Val Gin Gly Ser Gly Trp Gly Trp Leu Gly Leu Asn Pro Leu Thr 
145 150 155 160 

aag aag ctg gaa gtc acc acg acc gee aac cag gac cct ctg ctt act 528 
Lys Lys Leu Glu Val Thr Thr Thr Ala Asn Gin Asp Pro Leu Leu Thr 
165 170 175 

cac att cct ate ate gga gtt gac ate tgg gag cac get ttc tac ctt 576 
His He Pro He He Gly Val Asp He Trp Glu His Ala Phe Tyr Leu 
180 185 190 

cag tac aag aac gtc aag cct gac tat etc get get gtt tgg tec gtt 624 
Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu Ala Ala Val Trp Ser Val 
195 200 205 



ate aac tac aag gag gca gag gee cga ttg cag get get etc taa 
He Asn Tyr Lys Glu Ala Glu Ala Arg Leu Gin Ala Ala Leu 
210 ' ' 215 220 



•:210>5 

■:211>222 

<212>PRT 

■:213;' Phaffia rhodozyma 
•:400> 5 

Met Ser Val Arg Ala Ser Leu Ser Scr Val Ser Arg Gin Thr Phe Val 
15 10 15 

Ala Pro Ala Ala Phe Gin lie Arg Ala Lys His Thr Leu Pro Glu Leu 
20 25 30 

Pro Tyr Ala Tyr Asp Ala Leu Glu Pro Ser lie Ser Lys Glu lie Met 
35 40 45 

Thr Leu His His Thr Lys His His Gin Thr Tyr Val Asn Gly Leu Asn 
50 55 60 

Ala Ala Glu Glu Ser Tyr Ser Ala Ala Val Gly Lys Glu Asp Val Leu 
65 70 75 80 

Thr Gin Val Lys Leu Gin Ser Ala Leu Lys Phe Asn Gly Gly Gly His 
85 90 95 

He Asn His Ser Leu Phe Trp Lys Asn Leu Ala Pro Tyr Gly Ser Glu 
100 105 110 

Glu Ala Thr Leu Ser Glu Gly Pro Leu Lys Lys Ala He Glu Glu Ser 
115 120 125 

Phe Gly Ser Phe Glu Ala Phe Lys Lys Lys Phe Asn Ala Asp Thr Ala 
130 135 140 

Ala Val Gin Gly Ser Gly Trp Gly Trp Leu Gly Leu Asn Pro Leu Thr 
145 150 155 160 

Lys Lys Leu Glu Val Thr Thr Thr Ala Asn Gin Asp Pro Leu Leu Thr 
165 170 175 

His He Pro He He Gly Val Asp He Trp Glu His Ala Phe Tyr Leu 
180 185 190 



Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu Ala Ala Val Trp Ser Val 



195 



200 



205 



Ic Asn Tyr Lys Glu Ala Glu Ala Arg Leu Gin Ala Ala Leu 
210 215 220 



<210>6 
<211>597 
<212> DNA 

<213> Phaffia rhodozyma 

<220> 

<221>CDS 

<222>(1)..{597) 

<400> 6 

atg get cot tac act ctt ccc gac ctt cct tac get tac gat gee ttg 48 
Met Ala Pro Tyr Thr Leu Pro Asp Leu Pro Tyr Ala Tyr Asp Ala Leu 
1 5 10 15 

gag cct tac ate tct aag gaa ate atg ate ett cac eac tec aag cac 96 
Glu Pro Tyr He Ser Lys Glu He Met He Leu His His Ser Lys His 
20 25 30 

cat cag act tac gtc acc aac etc aac gee get ate cag get ttc tee 144 
His Gin Thr Tyr Val Thr Asn Leu Asn Ala Ala He Gin Ala Phe Ser 
35 40 45 

cag aec aat gac ate aag gee cag ate get ett cag age get etc aag 192 
Gin Thr Asn Asp He Lys Ala Gin He Ala Leu Gin Ser Ala Leu Lys 
50 55 60 

ttc aac gga gga gga eac ate aac cac tec etc ttc tgg aag aac atg 240 
Phe Asn Gly Gly Gly His He Asn His Ser Leu Phe Trp Lys Asn Met 
65 70 75 80 

get cct gee gac tct get gat gee aag etc aec gag gga teg etc aag 288 
Ala Pro Ala Asp Ser Ala Asp Ala Lys Leu Thr Glu Gly Ser Leu Lys 
85 90 95 

act gee ate gac aag gac ttt gga tec ttc gag gag ttc aag aag aag 336 
Thr Ala He Asp Lys Asp Phe Gly Ser Phe Glu Glu Phe Lys Lys Lys 
100 105 110 



ttc aac act gel act etc ggt gtc cag gga tct gga tgg gga tgg etc 384 
Phe Asn Thr Ala Thr Leu Gly Val Gin Gly Ser Gly Tip Gly Trp Leu 
115 120 125 

gga tac aac acc get acc aag cac etc gag ate gee ace ace gee aac 432 
G\y Tyr Asn Thr Ala Thr Lys His Leu Glu He Ala Thr Thr Ala Asn 
130 135 140 

cag gat ecc ctt ate act ttg act ecc ate att ggt ett gac ate tgg 480 
Gin Asp Pro Leu He Thr Leu Thr Pro He He Gly Leu Asp He Trp 
145 150 155 160 

gag cac get ttc tac etc cag tac aag aat gtc aag cct gat tac ctt 528 
Glu His Ala Phe Tyr Leu Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu 
165 170 175 

gcc get ttc tgg aac gtc tge aac ttt get gag get cag cga agg ttt 576 
Ala Ala Phe Trp Asn Val Cys Asn Phe Ala Glu Ala Gin Arg Arg Phe 
180 185 190 

gat get get gte aag get taa 597 
Asp Ala Ala Val Lys Ala 
195 



<210>7 
<211> 198 
<212> PRT 

<213> Phaffia rhodozyma 
<400> 7 

Met Ala Pro Tyr Thr Leu Pro Asp Leu Pro Tyr Ala Tyr Asp Ala Leu 
1 5 10 15 

Glu Pro Tyr He Ser Lys Glu He Met He Leu His His Ser Lys His 
20 25 30 

His Gin Thr Tyr Val Thr Asn Leu Asn Ala Ala He Gin Ala Phe Ser 
35 40 45 

Gin Thr Asn Asp He Lys Ala Gin He Ala Leu Gin Ser Ala Leu Lys 
50 55 60 



Phe Asn Gly Gly Gly His He Asn His Ser Leu Phe Trp Lys Asn Met 
65 " 70 75 80 



Ala Pro Ala Asp Scr Ala Asp Ala Lys Leu Thr Glu Gly Scr Leu Lys 
85 90 95 

Thr Ala lie Asp Lys Asp Phe Gly Ser Phe Glu Glu Phe Lys Lys Lys 
100 105 110 

Phe Asn Thr Ala Thr Leu Gly Val Gin Gly Ser Gly Trp Gly Trp Leu 
115 120 125 

Gly Tyr Asn Thr Ala Thr Lys His Leu Glu lie Ala Thr Thr Ala Asn 
130 135 140 

Gin Asp Pro Leu He Thr Leu Thr Pro He He Gly Leu Asp He Trp 
145 150 155 160 

Glu His Ala Phe Tyr Leu Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu 
165 170 175 

Ala Ala Phe Trp Asn Val Cys Asn Phe Ala Glu Ala Gin Arg Arg Phe 
180 185 190 

Asp Ala Ala Val Lys Ala 
195 



<210> 8 
<211>714 
<212> DNA 

<21 3> Phaffia rhodozyma 

<220> 

<221>CDS 

<222>(1)..(714) 

<400> 8 

tec gga age tea gat acc get cga gat cct cga ggl ttc tot ctt aag 48 
Ser Gly Ser Ser Asp Thr Ala Arg Asp Pro Arg Gly Phe Scr Leu Lys 
15 10 15 

gtc aag acc tct gag gga aac tgg gac ttt gtc gga aac aac act ccc 96 
Val Lys Thr Ser Glu Gly Asn Trp Asp Phe Val Gly Asn Asn Thr Pro 
20 25 30 



ate ttt ttc ttg aga gac cca gcc aag ttt ccg ate tte att eae aee 1 44 
He Phe Phe Leu Arg Asp Pro Ala Lys Phe Pro He Phe He His Thr 
35 40 45 

eag aag agg aae ccg cag aca aac tct aaa gac aag gac get tte tgg 192 
Gin Lys Arg Asn Pro Gin Thr Asn Ser Lys Asp Lys Asp Ala Phe Trp 
50 55 60 

gac tae eta tec eaa aac ccc gag tee gtg cat cag gtg ctg eae etg 240 
Asp Tyr Leu Ser Gin Asn Pro Glu Ser Val His Gin Val Leu His Leu 
65 70 75 80 

ttc agt gat cga gga acc cct get tct tac ega cac atg eat ggt tac 288 
Phe Ser Asp Arg Gly Thr Pro Ala Ser Tyr Arg His Met His Gly Tyr 
85 90 95 

tct gga cac acc ttc aag atg gtc aac agg aac ggt gac tgg aat tat 336 
Ser Gly His Thr Phe Lys Met Val Asn Arg Asn Gly Asp Trp Asn Tyr 
100 105 110 

gtc cag att cac atg cgc acc gat cag ggt gtc aag act cac acc aat 384 
Val Gin He His Met Arg Thr Asp Gin Gly Val Lys Thr His Thr Asn 
115 120 125 

gaa gag get teg aaa etc gac gcc tec aat ccc gat tea aac gga gac 432 
Glu Glu Ala Ser Lys Leu Asp Ala Ser Asn Pro Asp Ser Asn Gly Asp 
130 135 140 

gac ttg ttc gac gca ate aag aat gga gac ttc cct age tgg aeg gtt 480 
Asp Leu Phe Asp Ala He Lys Asn Gly Asp Phe Pro Ser Trp Thr Val 
145 150 ' 155 160 

cag gta cag gta atg tct cct gag cag gcc cag aag ttc aga tac aac 528 
Gin Val Gin Val Met Ser Pro Glu Gin Ala Gin Lys Phe Arg Tyr Asn 
165 170 175 

att ctg gat etc acc aag gtc tgg tec cac aag gag ttc cca ctt agg 576 
He Leu Asp Leu Thr Lys Val Trp Ser His Lys Glu Phe Pro Leu Arg 
180 185 190 

aeg att gga aag ttc act ttg aac ega aac gtg gat aac tat ttc gea 624 
Thr He Gly Lys Phe Thr Leu Asn Arg Asn Val Asp Asn Tyr Phe Ala 
195 200 205 

gag gtt gaa cag etc gcc ttt get cct tee cat ctg cct cct gga ate 672 



Glu Val Glu Gin Leu Ala Phe Ala Pro Ser His Leu Pro Pro Gly He 
210 215 220 



gag ccc teg aae gat ccc gtc ctt cag get ega eta tte tec 7 
Glu Pro Ser Asn Asp Pro Val Leu Gin Ala Arg Leu Phe Ser 
225 230 235 



<210>9 
<211>238 
<2I2> PRT 

<213> Phaffia rhodozyma 
<400> 9 

Ser Gly Ser Ser Asp Thr Ala Arg Asp Pro Arg Gly Phe Ser Leu Lys 
15 10 15 

Val Lys Thr Ser Glu Gly Asn Trp Asp Phe Val Gly Asn Asn Thr Pro 
20 25 30 

He Phe Phe Leu Arg Asp Pro Ala Lys Phe Pro He Phe He His Thr 
35 40 45 

Gin Lys Arg Asn Pro Gin Thr Asn Ser Lys Asp Lys Asp Ala Phe Trp 
50 55 60 

Asp Tyr Leu Ser Gin Asn Pro Glu Ser Val His Gin Val Leu His Leu 
65 70 75 80 

Phe Ser Asp Arg Gly Thr Pro Ala Ser Tyr Arg His Met His Gly Tyr 
85 90 95 

Ser Gly His Thr Phe Lys Met Val Asn Arg Asn Gly Asp Trp Asn Tyr 

100 105 no 

Val Gin He His Met Arg Thr Asp Gin Gly Val Lys Thr His Thr Asn 
115 120 125 

Glu Glu Ala Ser Lys Leu Asp Ala Ser Asn Pro Asp Ser Asn Gly Asp 
130 135 140 

Asp Leu Phe Asp Ala He Lys Asn Gly Asp Phe Pro Ser Tip Thr Val 
145 150 155 160 

Gin Val Gin Val Met Ser Pro Glu Gin Ala Gin Lys Phe Arg Tyr Asn 



165 



170 175 



lie Leu Asp Leu Thr Lys Val Tip Ser His Lys Glu Phe Pro Leu Arg 
180 185 190 

Thr He Gly Lys Phe Thr Leu Asn Arg Asn Val Asp Asn Tyr Phe Al 
195 200 205 

Glu Val Glu Gin Leu Ala Phe Ala Pro Ser His Leu Pro Pro Gly He 
210 215 220 

Glu Pro Ser Asn Asp Pro Val Leu Gin Ala Arg Leu Phe Ser 
225 230 235 



<210> 10 
<211>23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SequenceiSodl (sense 
primer for cloning of SOD genes) 

<400> 10 

aarcaycayc aracntaygt naa 2: 



<210> 11 
<211>23 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sod4 (antisense 
primer for cloning of SOD genes) 

<400> 11 

gcccanccng anccytgnac ncc 23 

<210> 12 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220:> 

<223;> Description of Artificial Sequence:Sodl4 (sense 
pnmer for the construction of SODl -disrupting 
plasmid) 

<400> 12 

ggtacctccg atgataggaa tgtgag 26 

<210> 13 
<211> 26 
<2 1 2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Sodl5 
(antisense primer for the constmction of 
SODl -disrupting plasmid) 

<400> 13 

gaattcagtt caacggagga ggacac 2( 



<2]0> 14 
<211>26 
<2]2>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Sod47 (sense 
primer for the constmction of S0D2-disrupting 
plasmid) 

<400> 14 

gaattcggag gaggacacat caaccg 2( 

<':210> 15 
<211>26 
< 212> DNA 

<213> Artificial Sequence 
<220> 

<^223> Description of Artificial Sequence:Sod48 



(anlisense primer for the construction of" 
S0D2-disrupting plasmid) 

<400> 1 5 

ggtacctgta ctggaggtag aaagcg 



<210> 16 
<211>23 
<212>DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence:Sod2 (sense 
primer for cloning of CAT gene) 

<400> 16 

mgnttytcna cngtnggngg nga 23 

<210> 17 
<211>23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Cat5 (antisense 
primer for cloning of CAT gene) 



<400> 17 

ckrtgnckyt gngtrtcngg rta 



23 



/ 
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<210> 1 
<211> 3632 
<212> DNA 

<2 1 3> Phaffia rhodozyma 



<400> 1 

Icctgttgat aatctttcta acgccttgta ctttgaccaa ggcgtttgtc cgaaattttg 60 
caaacttagt gttggtcgca tggacggtct tcggatccag aactgacggc tcgccaataa 1 20 
agtatgacga tggtagaggt gaaggaggga accacaggtt gaccagtctc aaagagtgct 1 80 
gatgtgcgcg aggatttgtc attaaatggt gttgtatatg ctagagccaa gagaagacat 240 
ttggttttgg ttttggtttt gcatttgatg agatgtgtca cgattgaaga cgggaggagg 300 
ctcactaacc caagaagcca ggatcaggag gaatgcctcc cccttttcat caagatcttt 360 
ctcacatcga acatttgaca ttctctttag tatccttcta tccttttctt ccaacttctc 420 
ccattgtatc gactttgctc gacttgctct tcttatctct gagcagagat gggcattcca 480 
atatcgaagg agcgacacaa gaccttggag tttgggtaac agatgaagag gggccgaggt 540 
ggatggggct gtaggaagta gctgatcgat gagttcctgg atgatgatag gcgaaggaac 600 
agacatagga tctctgtctc gtcctggaat tactgagtct tgtatccagc gtgttcttgt 660 
ctcgaagaag ccttcaagat cgatgtaaga taagacaggc aatgaggacg gacgaacgaa 720 
cgaacgaaaa gaacagaaga gctggtaagt cagtcagtca gtcagtcagt caatcaaaca 780 
ctggtgtcta gggttatagc tcgacgcgac gcgacgcgtt tgagacgcga tatgcttacg 840 
taatacctgg cgtcatcccc ccagccgagg caagagccga gccgctcgtg aacgacaaaa 900 
ttcaaaaggc tttctccatc ttaagctcat tctcatctaa ccgactcatc tcgttcccat 960 
cattcccatc attctaccgc catccatgtc tgttcgagca tccctctctt ccgtgtctag 1020 
acagactttc gtcgctcctg ctgctttcca gatcagggca aagcataccc tgcctgagct 1 080 
tccttacgct tacgatgtaa gacttttccg tgttctccta ttcgtcgctt tcttggtttt 1 140 
tttcgtcttc gccctctagc tcttcttcgt cctttctgtc ctgctctttg ttgttgatat 1200 
tcagctcgat agactaaccc atctcatctc ctggacattc ttttactgga aacgtatctt 1260 
gtccttggtt tttcttggct ttggttgaaa attcctctcc actcaggccc tggagccctc 1 320 



catctccaag gagatcatga cccttcacca caccaagcac catcagactt atgttaacgg 1 380 
cctcaacgct gccgaggaga gctactcggc cgctgtgggc aaggaggatg tgcttaccca 1 440 
ggttaagctt cagtctgtac gtctgaccgt ttttttatcg accggaacgc ctggtgagga 1 500 
gggagatgaa gtttgatgag cgctcatcgt ctagcacgtt gacccgatca tacaggctct 1 560 
caagttcaac ggaggaggac acatcaatcg tcagtgatat tcttcaaact cttgctgagc 1620 
aagtcaggtc aagctgactg tttcgctttg tttctgcgga tctatctcat ccttgatttg 1 680 
gcatgatgaa acagactctc tgttctggaa gaacttggct ccctatggat ccgaggaggc 1 740 
taccctctct gaaggacctc tcaagaaggc tatcgaggaa tcttttggtt ctttcgaggg 1 800 
tccgtccatc tatcttccta ttcagttgtg tttggttccg gtatactcat ctgttttgtt 1 860 
tccccacaaa ataaaaataa aaatcttgtc ctctccgggg gttcgactgc acgttcatag 1920 
ccttcaagaa gaagttcaac gctgacaccg ctgctgtcca aggatccgga tggggctggc 1 980 
ttgtatgtat catatccttt ccatctcaaa ctcttctcag agtctttttc cttgagactt 2040 
caaactgact atacatgttt ctacaacaaa caacagggct tgaacccgct tactaagaag 2 1 00 
ctggaagtca ccacgaccgc caaccaggac cctctgctta gtaagttgtt tctacatgat 2160 
tttctatctc aacgcgatct gcatgattcg tcactgattc actggattct cttgtttcgt 2220 
ttttctcggg atgatttcat aaacagctca cattcctatc atcggagttg acgtgcgtat 2280 
ctttcttgaa tagtcgtagc gtctgatctc gttttattga ctgacgtgtt gcttctgtcc 2340 
aaatcattaa aaaaaatgaa aacaaataat cgattgaccg acgaaaacag atctgggagc 2400 
acgctttcta ccttcagtac aagaacgtca agcctgacta tctcgctgct gtttggtccg 2460 
ttatcaacta caaggaggca gaggcccgat tgcaggctgc tctctaagcg ggacgaaaag 2520 
taacgacata tgaagggagg atcaaatatc gtttcttcat aaacaacttt cgaggcagat 2580 
gggagagtac gtacaagaga ggtttgtatg gagaattgag tttgttgacg gttagcaggt 2640 
tatgatatat gtagctatag tctagtctaa atctgaaaga agagaacaag atggtttgtc 2700 



cgaagagatt gagagatcaa gcccggtcat ctgatgtcga acaaacatgc cctggtctgc 2760 
caacagtttc tagcacatta tgaccatgtt catgtgtaaa ttgggaaatg agccagaaag 2820 
gtttattatc taattcattg attcatgcga ctatggatac atatgggatt tccagaacaa 2880 
acagatgcaa caaagcacgg cattttccaa agatcgaglc ctcccacaag tatgcggcaa 2940 
ggtttgttgt taagagatat aaaagcagac gacaaaacaa atcgtttatc gaccctgtgc 3000 
accaacaccg tgaccgtttg acgagttggt agagttgtag ttgttgctgt tcaaaggagc 3060 
tccagactgg acgcttccaa gcttcaacaa cttctcggca gcgtcgctgt tcgggaaaag 3120 
aaaaggcaaa aaggaacaga gcgataagca tatgtgattc tctacttctt ataggctctt 3 1 80 
agctcaagtc aactcacatg tctttggcgg taccgaagac gttctcaagc tgctgcttgg 3240 
aagcttttcc gagcttgcca gtaggtccct ggttggagaa gaagatgtcg aaggctaagg 3300 
gcgatgaaaa gcatgaagat attagctatc ggcgcgataa aagtgtgacg agatgaaaat 3360 
ggagaaaaga tgattcgcac catcgacgac ctcgaccaaa ggaatggagg tgtcaccggc 3420 
cttccacttc ttgtactcct caacgttgac gaagatgacg aagcagtcgg tggccttagc 3480 
ctcgggctcg tagatgctga tgaaacacaa taggtagtag gagaggagaa agagaagatg 3540 
atgagatgtc aggatgcttg cttcactgta gatggaggaa gaagatatgc gaagcaagac 3600 
atacactttg gaaagagctt gaaccattgt ag 3632 

<210> 2 
<211> 3375 
<212> DNA 

<213> Phaffia rhodozyma 



<400> 2 



cttatccttc tgccgctggt ctctgtctgt cgagtgtgtg ggatggtttt ggatatgttc 60 
ctatacgaaa ggtagcgcag agcaaagctg acagtattaa gcaagacaag agcttctttc 1 20 
tgttgacaga tgaaaggacg aactatgaag ctgtccatgc tccccaaacc gattgacaca 1 80 
ccgccgtcag gcaacgcaga atttctcact gcttcgacgt cacaccaaca tcgatcctcc 240 
atacctaaaa gcagatcgag acacattgtt ggtcgccatg ttggatggat gtacatcaaa 300 
cccacagcat atatcactca catgtgagaa ctccgtagcc tctaccttct tgtctctcaa 360 
tctgaatgtc tcgttgagag gtggaatgaa tgtttacagt ttgagaagac gaaagaaaga 420 
aagagaagag aagagaggaa tacgtacgac gaagttatca tcgtatggga acttttctaa 480 
aaaactgcct atagtagaga cgatctctgg aggaaagctc tgtagtatga tagtgaagag 540 
cgagcaagtc tgggcaagtg catccttcgt ctacaagaaa gagaccagga aatgaaggag 600 
agacgagtaa gcaggtacct accgatattg gatcgttctc tctacccagc gatgccttca 660 
ccacgcgttc tatctcttct tgggatggca gatacatact taacgagagc aatctgatgt 720 
ataccgaact tcgaacggaa tgatcccaga atcctcttga acccttgaac ccttgaaccc 780 
tggaaccaag taccaaccga gcaacacgcc gatacggtcc acaccacaga accacacgcc 840 
ctcgtcatta aaggtgggac gcgccgatgc tggttacgtt cggcccaatc cggaagttac 900 
cggcttggac gtgcctgtaa ccatgccctg acggtatttc gccttcagct aactccatct 960 
catctttttc ctttactacc acaacccacc cttgaacctt cttccccggc ttttttacta 1020 
tatccatcta tcaatcatca tggctcctta cactcttccc gacgtaagct taaagtttga 1080 
gctgtgtgtg cttatctcaa tcttggagtt gaactcaccg ttttttgttt ttgcttcctg 1 1 40 
gtttttttat cggcatccct cctttttttc ccctcgtggt cgcatatgat ttgctcatca 1200 
atcggcgttt cccatgcatc tttgtcatcc gtttcagctt ccttacgctt acgatgcctt 1260 
ggagccttac atctctaagg gtgagattct tagtcagact gttgttccgg ttcgacacga 1320 
tagctaatcg tctctcgttc ctcaatatga acatgcagaa atcatgatcc ttcaccactc 1380 



caagcaccat cagacttacg tcgtacgtaa tctaaaggtc atctccgtct acatggccgg 1440 
atcaacttgc tcatagatct tccttctgtt cggcgctacg tagaccaacc tcaacgccgc 1 500 
tatccaggct ttctcccaga ccaatgacat caaggcccag atcgctcttc agagcgctct 1 560 
caagttcaac ggaggaggac acatcaaccg tacgatcatt ctccctcttc tggcttatca 1620 
tatgtgttgc ttgtcactaa cacgcatgca accccgggat atctcaccct gtagactccc 1 680 
tcttctggaa gaacatggct cctgccgact ctgctgatgc caagctcacc gagggatcgc 1 740 
tcaagactgc catcgacaag gactttggat ccttcgagga gttcaagaag aagttcaaca 1 800 
ctgctactct cggtgtccag ggatctgtca gtatctcgtt tgcttcgaca tactctcagc 1 860 
tttccttccg taaactgacg aatagttttt cggacatgta cttgtaggga tggggatggc 1 920 
tcgtgcgttt gacctttttc cactttgaac attagcgata gtgataccta acaactgtga 1980 
attggaatat agggatacaa caccgctacc aagcacctcg agatcgccac caccgccaac 2040 
caggatcccc ttatcagtat gtgacttctc tcgtgtggtc accataagcc agttgctgac 2 1 00 
acatttcgtt cgctgtctct cgacttcgta gctttgactc ccatcattgg tcttgacgtt 2 1 60 
agtaattcta tctagtgatt ggagtcgagt tctgaacttg ccttgatctc aaacgaatga 2220 
atcaatttct tttggtagat ctgggagcac gctttctacc tccagtacaa gaatgtcaag 2280 
cctgattacc ttggtacgta attctctatt cgtttgcccc ggtttgatct ttgactcact 2340 
cttcaaaatg ttttcgtttg taactttgaa aaacagccgc tttctggaac gtctgcaact 2400 
ttgctgaggc tcagcgaagg tttgatgtga gtacaggcgc tacccctacg gaggaagcga 2460 
aggtgagctg accacttttt atctttctga tttggaatga acgatccgat gatcaaacaa 2520 
acaggctgct gtcaaggctt aatggtccca tttatctctt tgattcgacg gcgatgacgg 2580 
ctttctcgca tccgaagaag gcaaggctat gattactgtt attctgccat gttgcttgct 2640 
ttgctatgct ctatgttctt ttcttttgcc tctcttcaaa gccaaggcgt taaggaaggc 2700 



ccttcagtct gttttacata tgcacatata catgagaaca tatcacggac tcggcggctg 2760 
gtggtcctct tgagcgtcgg cttcaagatt agtgtccaca cgtgaagcgt tcggtgccat 2820 
ccaacctggt aggaatcccc atcgggcggg aatccaatta tcaattggcg gtcggccaga 2880 
ttcgagctcg ggtatctcag aagcgtcaag cgggcgcatt tccaggcctt taagaggagc 2940 
aaatttaatc cgcctgggtg ttcagcgaga cacgaacagt ttgaaacaga gtctgcttgt 3000 
gagttactcg gcgagatcac tgaggactaa actttctcag ctcgtggacg aaaagaacga 3060 
accaaacggt cttccctgta tctcgaccat ctccttctcc atctcttaca acacctcgga 3 1 20 
tgaactccaa ggcttgcttt ccaaagttca aacaaactcc gggttgccat ccacctggtt 3 1 80 
tgtctctaac gagccgaggg atatccatcg ttcggaacgt ttgaacagac tggatggtag 3240 
gtggccggtc gcttcggaag ccaatcataa tggtgggaat cgagagaagg aatgattggg 3300 
cccagtgttt aagacttgtg tttgttggca gagtacggac ggaaagtagg acagacttaa 3360 
tcaaggcgag ccaag 3375 

<210> 3 
<2ll> 951 
<212> DNA 

<213> Phaffia rhodozyma 
<400> 3 

tccggaagct cagataccgc tcgagatcct cgaggtttgt gtgctttcgc tttgttcgca 60 
tggatgaagc tgttaactta aaaaaatcct cgtgtttctc tttgtttcaa cataggtttc 1 20 
tctcttaagg tcaagacctc tgagggaaac tgggactttg tacgtattct tatcgactga 1 80 
gtcatcaagc tcgttatcgc tctcttaccc tcatcctttt gtgtctctgt ctacacctct 240 
aggtcggaaa caacactccc atctttttct tgagagaccc agccaagttt ccgatcttca 300 



ttcacaccca gaagaggaac ccgcagacaa actctaaaga caaggacgct ttctgggact 360 
accgttcgta taaccttgtc actccctgcg tgccgctctg attcatgttg accttgtctt 420 
tgatataatt ttatagtatc ccaaaacccc gagtccgtgc atcaggtgct gcacctgttc 480 
agtgatcgag gaacccctgc ttcttaccga cacatgcatg gttactctgg acacaccttc 540 
aagatggtca acaggaacgg tgactggaat tatgtccaga ttcacatgcg caccgatcag 600 
ggtgtcaaga ctcacaccaa tgaagaggct tcgaaactcg acgcctccaa tcccgattca 660 
aacggagacg acttgttcga cgcaatcaag aatggagact tccctagctg gacggttcag 720 
gtacaggtaa tgtctcctga gcaggcccag aagttcagat acaacattct ggatctcacc 780 
aaggtctggt cccacaagga gttcccactt aggacgattg gaaagttcac tttgaaccga 840 
aacgtggata actatttcgc agaggttgaa cagctcgcct ttgctccttc ccatctgcct 900 
cctggaatcg agccctcgaa cgatcccgtc cttcaggctc gactattctc c 95 1 



<210> 4 
<211> 669 
<212> DNA 

<213> Phaffia rhodozyma 



<220> 

<221> CDS 

<222> (1)..(666) 

<223> n or X = A, C, G or T 



<400> 4 



atg tct gtt cga gca tec etc tct tec gtg tct aga cag act ttc gtc 48 
Met Ser Val Arg Ala Ser Leu Ser Ser Val Ser Arg Gin Thr Phe Val 
15 10 15 

get cct get get ttc cag ate agg gca aag cat ace ctg ect gag ctt 96 
Ala Pro Ala Ala Phe Gin He Arg Ala Lys His Thr Leu Pro Glu Leu 
20 25 30 

cet tae get tac gat gee ctg gag cec tec ate tec aag gag ate atg 144 
Pro Tyr Ala Tyr Asp Ala Leu Glu Pro Ser He Ser Lys Glu He Met 
35 40 45 

ace ctt cae cae aec aag eac cat cag act tat gtt aae ggc etc aae 192 
Thr Leu His His Thr Lys His His Gin Thr Tyr Val Asn Gly Leu Asn 
50 55 60 

get gee gag gag age tac teg gee get gtg ggc aag gag gat gtg ctt 240 
Ala Ala Glu Glu Ser Tyr Ser Ala Ala Val Gly Lys Glu Asp Val Leu 
65 70 75 80 

ace cag gtt aag ctt cag tct get etc aag ttc aae gga gga gga eac 288 
Thr Gin Val Lys Leu Gin Ser Ala Leu Lys Phe Asn Gly Gly Gly His 
85 90 95 

ate aat cae tet ctg ttc tgg aag aae ttg get cec tat gga tec gag 336 
He Asn His Ser Leu Phe Trp Lys Asn Leu Ala Pro Tyr Gly Ser Glu 
100 105 110 

gag get aec etc tct gaa gga cet etc aag aag get ate gag gaa tct 384 
Glu Ala Thr Leu Ser Glu Gly Pro Leu Lys Lys Ala He Glu Glu Ser 
115 120 125 

ttt ggt tct ttc gag gee ttc aag aag aag ttc aac get gae aec get 432 
Phe Gly Ser Phe Glu Ala Phe Lys Lys Lys Phe Asn Ala Asp Thr Ala 
130 135 140 

get gte caa gga tee gga tgg ggc tgg ctt ggc ttg aae ecg ctt act 480 
Ala Val Gin Gly Ser Gly Trp Gly Trp Leu Gly Leu Asn Pro Leu Thr 
145 150 155 160 

aag aag ctg gaa gte ace aeg aec gee aac cag gae cet ctg ctt act 528 
Lys Lys Leu Glu Val Thr Thr Thr Ala Asn Gin Asp Pro Leu Leu Thr 
165 170 175 



cae att ect ate ate gga gtt gae ate tgg gag eac get ttc tac ctt 576 



His lie Pro He He Gly Val Asp He Trp Glu His Ala Phe Tyr Leu 
180 185 190 



cag tac aag aac gtc aag cct gac tat etc get get gtt tgg tec gtt 624 
Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu Ala Ala Val Trp Ser Val 
195 200 205 

ate aac tac aag gag gca gag gee ega ttg cag get get etc taa 669 
lie Asn Tyr Lys Glu Ala Glu Ala Arg Leu Gin Ala Ala Leu 
210 215 220 



<210> 5 
<211> 222 
<212> PRT 

<213> Phaffia rhodozyma 
<400> 5 

Met Ser Val Arg Ala Ser Leu Ser Ser Val Ser Arg Gin Thr Phe Val 
15 10 15 

Ala Pro Ala Ala Phe Gin He Arg Ala Lys His Thr Leu Pro Glu Leu 
20 25 30 



Pro Tyr Ala Tyr Asp Ala Leu Glu Pro Ser He Ser Lys Glu He Met 
35 40 45 



Thr Leu His His Thr Lys His His Gin Thr Tyr Val Asn Gly Leu Asn 
50 55 60 



Ala Ala Glu Glu Ser Tyr Ser Ala Ala Val Gly Lys Glu Asp Val Leu 
65 70 75 80 



Thr Gin Val Lys Leu Gin Ser Ala Leu Lys Phe Asn Gly Gly Gly His 



85 90 95 



He Asn His Ser Leu Phe Trp Lys Asn Leu Ala Pro Tyr Gly Ser Glu 
100 105 110 



Glu Ala Thr Leu Ser Glu Gly Pro Leu Lys Lys Ala He Glu Glu Ser 
115 120 125 



Phe Gly Ser Phe Glu Ala Phe Lys Lys Lys Phe Asn Ala Asp Thr Ala 
130 135 140 



Ala Val Gin Gly Ser Gly Trp Gly Trp Leu Gly Leu Asn Pro Leu Thr 
145 150 155 160 



Lys Lys Leu Glu Val Thr Thr Thr Ala Asn Gin Asp Pro Leu Leu Thr 
165 170 175 



His He Pro He He Gly Val Asp He Trp Glu His Ala Phe Tyr Leu 
180 185 190 



Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu Ala Ala Val Trp Ser Val 
195 200 205 



He Asn Tyr Lys Glu Ala Glu Ala Arg Leu Gin Ala Ala Leu 
210 215 220 



<210> 6 
<211> 597 
<212> DNA 



<213> Phaffiarhodozyma 



<220> 
<221> CDS 
<222> (1)..(594) 
<223> 



<400> 6 

atg get cct tac act ctt ccc gac ctt cct tac get tac gat gee ttg 48 
Met Ala Pro Tyr Thr Leu Pro Asp Leu Pro Tyr Ala Tyr Asp Ala Leu 
15 10 15 

gag cct tac ate tct aag gaa ate atg ate ctt cae cac tec aag cac 96 
Glu Pro Tyr He Ser Lys Glu lie Met He Leu His His Ser Lys His 
20 25 30 

cat cag act tac gtc acc aac etc aac gee get ate cag get ttc tec 144 
His Gin Thr Tyr Val Thr Asn Leu Asn Ala Ala lie Gin Ala Phe Ser 
35 40 45 

cag acc aat gac ate aag gee cag ate get ctt cag age get etc aag 1 92 
Gin Thr Asn Asp lie Lys Ala Gin He Ala Leu Gin Ser Ala Leu Lys 
50 55 60 

ttc aac gga gga gga cac ate aac cac tec etc ttc tgg aag aac atg 240 
Phe Asn Gly Gly Gly His He Asn His Ser Leu Phe Trp Lys Asn Met 
65 70 75 80 

get cet gee gac tct get gat gee aag etc acc gag gga teg etc aag 288 
Ala Pro Ala Asp Ser Ala Asp Ala Lys Leu Thr Glu Gly Ser Leu Lys 
85 90 95 



act gee ate gac aag gac ttt gga tec ttc gag gag ttc aag aag aag iiO 
Thr Ala He Asp Lys Asp Phe Gly Ser Phe Glu Glu Phe Lys Lys Lys 
100 105 110 



ttc aac act get act etc ggt gtc cag gga tct gga tgg gga tgg etc 384 
Phe Asn Thr Ala Thr Leu Gly Val Gin Gly Ser Gly Trp Gly Trp Leu 
115 120 125 



gga tac aac acc get acc aag cac etc gag ate gee acc acc gee aac 4J 
Gly Tyr Asn Thr Ala Thr Lys His Leu Glu He Ala Thr Thr Ala Asn 



130 135 140 

cag gat ccc ctt ate act ttg act ccc ate att ggt ctt gac ate tgg 480 
Gin Asp Pro Leu He Thr Leu Thr Pro He He Gly Leu Asp He Trp 
145 150 155 160 

gag cac get tte tae etc cag tac aag aat gtc aag cct gat tac ctt 528 
Glu His Ala Phe Tyr Leu Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu 
165 170 175 

gcc get ttc tgg aac gtc tgc aac ttt get gag get cag cga agg ttt 576 
Ala Ala Phe Trp Asn Val Cys Asn Phe Ala Glu Ala Gin Arg Arg Phe 
180 185 190 

S07 

gal get get gtc aag get taa 
Asp Ala Ala Val Lys Ala 
195 



<210> 7 
<211> 198 
<212> PRT 

<213> Phaffia rhodozyma 
<400> 7 

Met Ala Pro Tyr Thr Leu Pro Asp Leu Pro Tyr Ala Tyr Asp Ala Leu 
15 10 15 

Glu Pro Tyr He Ser Lys Glu lie Met He Leu His His Ser Lys His 
20 25 30 

His Gin Thr Tyr Val Thr Asn Leu Asn Ala Ala He Gin Ala Phe Ser 
35 40 45 



Gin Thr Asn Asp He Lys Ala Gin He Ala Leu Gin Ser Ala Leu Lys 
50 55 60 



Phe Asn Gly Gly Gly His He Asn His Ser Leu Phe Trp Lys Asn Met 
65 70 75 80 



Ala Pro Ala Asp Ser Ala Asp Ala Lys Leu Thr Glu Gly Ser Leu Lys 
85 90 95 



Thr Ala He Asp Lys Asp Phe Gly Ser Phe Glu Glu Phe Lys Lys Lys 
100 105 110 



Phe Asn Thr Ala Thr Leu Gly Val Gin Gly Ser Gly Trp Gly Trp Leu 
115 120 125 



Gly Tyr Asn Thr Ala Thr Lys His Leu Glu lie Ala Thr Thr Ala Asn 
130 135 140 



Gin Asp Pro Leu He Thr Leu Thr Pro He He Gly Leu Asp He Trp 
145 150 155 160 



Glu His Ala Phe Tyr Leu Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu 
165 170 175 



Ala Ala Phe Trp Asn Val Cys Asn Phe Ala Glu Ala Gin Arg Arg Phe 
180 185 190 



Asp Ala Ala Val Lys Ala 
195 



<210> 8 
<211> 714 
<212> DNA 



<213> Phaffia rhodozyma 



<220> 
<221> CDS 
<222> (1)..(714) 
<223> 



<400> 8 

tec gga age tea gat aec get cga gat cct ega ggt tte tct ctt aag 48 
Ser Gly Ser Ser Asp Thr Ala Arg Asp Pro Arg Gly Phe Ser Leu Lys 
15 10 15 

gte aag ace tet gag gga aac tgg gae ttt gtc gga aac aac act eec 96 
Val Lys Thr Ser Glu Gly Asn Trp Asp Phe Val Gly Asn Asn Thr Pro 
20 25 30 

ate ttt tte ttg aga gae eca gee aag ttt ccg ate tte att cac ace 1 44 
lie Phe Phe Leu Arg Asp Pro Ala Lys Phe Pro He Phe He His Thr 
35 40 45 

cag aag agg aac ceg eag aea aae tet aaa gae aag gae get tte tgg 1 92 
Gin Lys Arg Asn Pro Gin Thr Asn Ser Lys Asp Lys Asp Ala Phe Trp 
50 55 60 

gae tae eta tee caa aae eec gag tee gtg eat eag gtg ctg cae etg 240 
Asp Tyr Leu Ser Gin Asn Pro Glu Ser Val His Gin Val Leu His Leu 
65 70 75 80 

tte agt gat ega gga ace cet get tct tae cga cac atg cat ggt tae 288 
Phe Ser Asp Arg Gly Thr Pro Ala Ser Tyr Arg His Met His Gly Tyr 
85 90 95 

tct gga cac ace tte aag atg gtc aae agg aae ggt gae tgg aat tat 336 
Ser Gly His Thr Phe Lys Met Val Asn Arg Asn Gly Asp Trp Asn Tyr 
100 105 110 

gte cag att cac atg cgc ace gat eag ggt gtc aag act cae ace aat 384 
Val Gin He His Met Arg Thr Asp Gin Gly Val Lys Thr His Thr Asn 
115 120 125 



gaa gag get teg aaa etc gae gee tec aat eec gat tea aae gga gae 432 
Glu Glu Ala Ser Lys Leu Asp Ala Ser Asn Pro Asp Ser Asn Gly Asp 
130 135 140 

gae ttg ttc gae gea ate aag aat gga gae tte ect age tgg acg gtt 480 
Asp Leu Phe Asp Ala He Lys Asn Gly Asp Phe Pro Ser Trp Thr Val 
145 150 155 160 

eag gta cag gta atg tct cet gag eag gee cag aag tte aga tae aae 528 
Glu Val Gin Val Met Ser Pro Glu Gin Ala Gin Lys Phe Arg Tyr Asn 
165 170 175 

att ctg gat etc ace aag gtc tgg tee cae aag gag ttc eca ctt agg 576 
He Leu Asp Leu Thr Lys Val Trp Ser His Lys Glu Phe Pro Leu Arg 
180 185 190 

acg att gga aag ttc act ttg aae cga aae gtg gat aae tat ttc gea 624 
Thr He Gly Lys Phe Thr Leu Asn Arg Asn Val Asp Asn Tyr Phe Ala 
195 200 205 

gag gtt gaa eag cte gee ttt get ect tec cat etg ect cet gga ate 672 
Glu Val Glu Gin Leu Ala Phe Ala Pro Ser His Leu Pro Pro Gly He 
210 215 220 

gag cce teg aae gat eec gtc ctt eag get cga eta tte tec 714 
Glu Pro Ser Asn Asp Pro Val Leu Gin Ala Arg Leu Phe Ser 
225 230 235 



<210> 9 
<211> 238 
<212> PRT 

<213> Phaffia rhodozyma 



<400> 9 



Ser Gly Ser Ser Asp Thr Ala Arg Asp Pro Arg Gly Phe Ser Leu Lys 
15 10 15 



Val Lys Thr Ser Glu Gly Asn Trp Asp Phe Val Gly Asn Asn Thr Pro 
20 25 30 



He Phe Phe Leu Arg Asp Pro Ala Lys Phe Pro He Phe He His Thr 
35 40 45 



Gin Lys Arg Asn Pro Gin Thr Asn Ser Lys Asp Lys Asp Ala Phe Trp 
50 55 60 



Asp Tyr Leu Ser Gin Asn Pro Glu Ser Val His Gin Val Leu His Leu 
65 70 75 80 



Phe Ser Asp Arg Gly Thr Pro Ala Ser Tyr Arg His Met His Gly Tyr 
85 90 95 



Ser Gly His Thr Phe Lys Met Val Asn Arg Asn Gly Asp Trp Asn Tyr 
100 105 110 



Val Gin He His Met Arg Thr Asp Gin Gly Val Lys Thr His Thr Asn 
115 120 125 



Glu Glu Ala Ser Lys Leu Asp Ala Ser Asn Pro Asp Ser Asn Gly Asp 
130 135 140 



Asp Leu Phe Asp Ala lie Lys Asn Gly Asp Phe Pro Ser Trp Thr Val 
145 150 155 160 



Gin Val Gin Val Met Ser Pro Glu Gin Ala Gin Lys Phe Arg Tyr Asn 
165 170 175 



lie Leu Asp Leu Thr Lys Val Trp Ser His Lys Glu Phe Pro Leu Arg 
180 185 190 



Thr He Gly Lys Phe Thr Leu Asn Arg Asn Val Asp Asn Tyr Phe Ala 
195 200 205 

Glu Val Glu Gin Leu Ala Phe Ala Pro Ser His Leu Pro Pro Gly He 
210 215 220 

Glu Pro Ser Asn Asp Pro Val Leu Gin Ala Arg Leu Phe Ser 
225 230 235 



<210> 10 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sodl (sense primer for cloning of SOD genes) 
<220> 

<221> misc_feature 

<222> (1)..(23) 

<223> norX = A, C, GorT 



<400> 10 

aarcaycayc aracntaygt naa 



<210> 11 
<211> 23 
<212> DNA 



23 



<213> Artificial Sequence 



<220> 

<223> Sod4 (antisense primer for cloning of SOD genes) 
<220> 

<221> misc feature 

<222> (1)..(23) 

<223> norX = A, C, GorT 

<400> 11 

gcccanccng anccytgnac ncc 

<210> 12 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sodl4 (sense primer for the construction of SOD 1 -disrupting plas 
mid) 

<400> 12 

ggtacctccg atgataggaa tgtgag 

<210> 13 
<211> 26 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Sodl5 (antisense primer for the coristruction of SOD 1 -disrupting p 
lasmid) 

<400> 13 

gaattcagtt caacggagga ggacac 26 

<210> 14 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sod47 (sense primer for the contraction of S0D2-disrupting plasmi 
d) 

<400> 14 

gaattcggag gaggacacat caaccg 26 

<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Sod48 (antisense primer for the construction of S0D2-disrupting p 
lasmid) 

<400> 15 

ggtacctgta ctggaggtag aaagcg 26 

<210> 16 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sod2 (sense primer for cloning of CAT gene) 
<220> 

<221> misc_feature 

<222> (1)..(23) 

<223> n or X = A, C, G or T 

<400> 16 

mgnttytcna cngtnggngg nga 23 

<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Cats (antisense primer for cloning of CAT gene) 
<220> 

<221> misc_feature 

<222> (1)..(23) 

<223> n or X = A, C, G or T 



<400> 17 

ckrtgnckyt gngtrtcngg rta 



23 



